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TECHNICAL FIELD 

The present invention is directed to the field of electronic commerce, and, 
more particularly, to the field of purchasing opportunity analysis. 

BACKGROUND 

Electronic commerce enables computer users to purchase items, such as 
products and services, on the World Wide Web in a variety of ways using their 
computers. For example, opportunities to purchase items online include: online stores 
that are analogous to brick and mortar stores selling large inventories of items at a fixed 
price, online classifieds that sell a single item at a fixed price, online auctions in which 
users bid on an item, online reverse auctions in which sellers bid to sell an item to a user, 
and online purchase aggregation, in which groups of users aggregate their orders for 
products. 

It can b^ use- ^1 for users to be able to identify similar purchasing 
opportunities, such as different opportunities to purchase the same or similar items. In 
the case of one-at-a-time purchasing opportunities such as online classifieds, a user may 
wish to find additional opportunities to purchase the same item where the user needs 
more than one of the item. For all kinds of purchasing opportunities, a user may wish to 
(a) price-shop for the same item, or (b) identify similar items that the user might also 
wish to purchase, either as an alternative to the first item or in addition to it. 

Unfortunately, purchasing opportunities are generally not well-organized 
with respect to these objectives. Indeed, it is common for purchasing opportunities to be 
organized into general item categories, such as "office supplies" vs. "kitchen 
implements." In some cases, purchasing opportunities are not organized at all, 
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Accordingly, a technique for automatically identifying opportunities to 
purchase similar items on the World Wide Web would have significant utility. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a high-level block diagram showing the environment in which 
the facility preferably operates. 

Figure 2 is a display diagram showing a representative window displayed 
by the facility. 

Figure 3 is flow diagram showing the steps preferably performed by the 
facility in order to score the similarity of other purchasing opportunities to a distinguished 
purchasing opportunity. 

Figure 4 is a data structure diagram showing a purchasing opportunity table 
preferably used by the facility. 

DETAILED DESCRIPTION 

In a preferred embodiment, a software facility for automatically identifying 
similar purchasing opportunities ("the facility") is provided. For example, for an initial 
auction, the facility can preferably identify auctions, or purchasing opportunities of other 
types, that offer the same or a similar item, similar prices or other terms, etc. 

The facility preferably uses descriptive information about an initial 
purchasing opportunity to identify purchasing opportunities that are similar to the initial 
purchasing opportunity. The descriptive information may include information describing 
the item offered in the initial purchasing opportunity, as well as information describing 
other terms of the purchasing opportunity, such as price, availability, seller identity or 
location, purchasing opportunity type, as well as a variety of other types of information 
about the initial purchasing opportunity. 

The facility first identifies key words occurring in the descriptive 
information for the initial purchasing opportunity whose occurrence tend to best 
differentiate the initial purchasing opportunity from others. The facility attributes a score 
to each key word quantifying this tendency. In a preferred embodiment, the facility 
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identifies key words ana attributes scores using the inverse document frequencies of the 
terms occurring in the descriptive information for the initial purchasing opportunity. The 
inverse document frequency of a term measures the fraction of occurrences of the term 
among the descriptive information for all of the purchasing opportunities that occur in the jeA^ 
5 descriptive information for the initial purchasing opportunity. For example, if a particular ^ 0 
term occurred three times in the descriptive information for the initial purchasing fo*>\ 
opportunity and occurred 100 times in the descriptive information for all of the 
purchasing opportunities, that term would have an inverse document frequency of 3%, or 
.03. 

io The facility searches the descriptive information for all of the purchasing 

opportunities to determine, for each key word, which purchasing opportunities' 
descriptive information contains the key word. In one embodiment, the facility does so 
by conducting a separate search for each key word, producing, for each key word, a 
search result containing a list of all of the purchasing opportunities whose descriptive 

15 information contain the key word. The facility then generates similarity scores for at 
least some of the other purchasing opportunities by, for each such purchasing 
opportunity, summing the scores of key words that occur in the purchasing opportunities. 

The purchasing opportunity scores may then be used to order the 
purchasing opportunities based upon their level of similarity to the initial purchasing 

20 opportunity. For example, the facility may order other purchasing opportunities in 
decreasing order of their similarity to the initial purchasing opportunity. The purchasing 
opportunity scores may also be used to subset the purchasing opportunities based upon 
their level of similarity to the initial purchasing opportunity. For example, the facility 
may subset the other purchasing opportunities to include only the 20 purchasing 

25 opportunities that are the most similar to the initial purchasing opportunity, or may subset 
them to include only the purchasing opportunities that are judged to have at least a 90% 
likelihood of offering the same item that is offered in the initial purchasing opportunity. 
By displaying such ordered and/or subsetted lists of purchasing opportunities, the facility 
enables a user to purchase the same item from multiple sources, price-shop for an item 

30 among different sellers and/or purchasing opportunity types, or identify similar items that 
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the user might also wish to purchase, either as an alternative to the first item or in 
addition to it. 

Figure 1 is a high-level block diagram showing the environment in which 
the facility preferably operates. The block diagram shows several client computer 
5 systems, such as client computer systems 110, 120, and 130. Each of the client computer 
systems has a web client computer program for browsing the World Wide Web, such as 
web clients 111, 121, and 131. The client computer systems are connected via the 
Internet 140 to a server computer system 150 hosting the facility. Those skilled in the art 
will recognize that client computer systems could be connected to the server computer 
10 system by networks other than the Internet, however. 

The server computer system 150 contains a memory 160. The memory 160 
preferably includes auction software 161 incorporating the facility 162, as well as a web 
server computer program 163 for delivering web pages in response to requests from web 
clients. ^While items 161-163 are preferably stored in memory while being used, those 
15 skilled in the art will appreciate that these items, or portions of them, may be transferred 
between memory and a persistent storage device 172 for purposes of memory 
management and data integrity. The server computer system further contains one or more 
central processing units (CPU) 171 for executing programs, such as programs 161-163, 
and a computer-readable medium drive 173 for reading information or installing 
20 programs such as the facility from computer-readable media, such as a floppy disk, a 
CD-ROM, or a DVD. 

While preferred embodiments are described in terms in the environment 
described above, those skilled in the art will appreciate that the facility may be 
implemented in a variety of other environments including a single, monolithic computer 
25 system, as well as various other combinations of computer systems or similar devices 
connected in various ways. 

To more fully illustrate its implementation and operation, the facility is 
described in conjunction with an example. 

Figure 2 is a display diagram showing a representative window displayed 
30 by the facility. Window 200 contains descriptive information 210 about a current 
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auction for Archie and Jughead salt and pepper shakers. Information 210 includes such 
fields as name, current bid, last bid, opening time, closing time, number of bids, starting 
bid, amount, accepted payment methods, detailed item description, photo, seller identity, 
shipping fee, customer service policy, and item category. In additional embodiments, 

5 other types of data may be included in information 210. The window further includes 
controls 221 and 222 for bidding on the current auction. A user can use these controls to 
bid on the auction by typing a bid amount in bid amount field 221, then clicking on "bid" 
button 222. The window further includes a "see similar auctions" button 223 for 
displaying information about auctions for similar items. The user may use this control to 

10 display information about similar auctions by clicking on the "see similar auctions" 
button 223. When the user clicks on button 223, the facility analyzes other auctions to 
identify similar ones. In additional embodiments, the facility can identify auctions for the 
same item, or purchasing opportunities of various types for the same item or similar 
items. 

15 Figure 3 is flow diagram showing the steps preferably performed by the 

facility in order to score the similarity of other purchasing opportunities to a distinguished 
purchasing opportunity, such as the auction described by information 2 10. A In step 301, 
the facility selects terms in the description of the distinguished item or items having the 
highest inverse document frequencies. In various embodiments, terms are selected from 
20 various combinations of fields 210. Steps 302-304 constitute a loop that is repeated for 
each term selected as a key word in step 301. In step 303, the facility conducts a search 
for purchasing opportunities whose item descriptions contain the selected term. In step 
304, if additional selected terms remain, then the facility continues in step 302 to process 
the next selected term. Steps 305-308 constitute a loop that is repeated for each 
25 purchasing opportunity found in at least one of the searches conducted in step 303. In 
step 306, the facility determines which selected terms occur in the item description for the 
purchasing opportunity. In step 307, the facility determines a score for the purchasing 
opportunity by summing the inverse document frequencies of any of the selected terms 
that occurred in the item description for the purchasing opportunity. In step 308, if 
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additional purchasing opportunities remain, then the facility continues in step 305 to 
process the next purchasing opportunity. After step 308, the steps conclude. 

Figure 4 is a data structure diagram showing a purchasing opportunity table 
preferably used by the facility. Each row in the purchasing opportunity table corresponds 

5 to a different purchasing opportunity. As shown, each row in the table contains two 
fields: a purchasing opportunity identifier field 411 contain an identifier that uniquely 
identifies the purchasing opportunity of the row, and an item description field 412 that 
contains a description of the item offered in the purchasing opportunity. For example, in 
row 401, it can be seen that the Archie and Jughead salt and pepper auction shown in 

10 Figure 2 has purchasing opportunity identifier "100976591," and item description 
"Archie and Jughead ceramic salt and pepper shakers, in original package, perfect 
condition." 

p While the descriptive information used by the facility is shown in a single 

y£ table having only two columns, is shown as corresponding only to the "Detail" field 
W 15 shown in Figure 2, those skilled in the art will appreciate that this information could be 
pi distributed across a large number of different tables, or even stored in a, completely 
J different form. Such tables may contain additional columns other than those shown in 
^ Figure 4 and may be stored in compressed or otherwise optimized form. Further, the 
y descriptive information used by the facility may include additional information associated 
g 20 with each purchasing opportunity in addition to or instead of the contents of the "Detail" 
S field. 

In one embodiment, the purchasing opportunity table is derived directly 
from electronic records of purchasing opportunities managed by the operator of the 
facility. In an additional embodiment, the purchasing opportunity table is compiled by 
25 systematically retrieving and analyzing information describing purchasing opportunities 
managed by others. 

In order to find purchasing opportunities similar to the auction shown in 
Figure 2, the facility determines the inverse document frequencies of the words, or 
"terms," occurring in the item description for the auction in Figure 2, which is referred to 
30 as the "distinguished purchasing opportunity." The result is shown below in Table 1. 
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• # 



Term From Item 
Description for 
uisiinguisnea 
Purchasing 
Opportunity 


Inverse Document 
Frequency in 
Distinguisneu 

Item 
Description 


Archie 


04123692 


Qtin 
aXYKX 


000004? 1 




0652191 1 

. UUJZ,J 711 


ceramic 


00127635 


calf 


0001 S761 

.UU7 1JZ.UJ 


pcppci 


008Q7750 


shakers 

JJ.llU.Wi. 


00904123 


in 


00000517 


ori crinal 


00008216 


nackaee 


00005972 


perfect 


.00004257 


condition 


.00004924 



TABLE 1 



The facility uses these inverse document frequencies to identify key words 
among the terms occurring in the item description for the distinguished purchasing 
opportunity. In various embodiments, the facility (a) selects any terms having an inverse 
document frequency higher than a fixed threshold; (b) selects a fixed number of the terms 
having the highest inverse document frequencies; and (c) selects a fixed percentage of the 
terms having the highest document frequencies. In the example, the facility utilizes an 
inverse document frequency absolute threshold of .0010000. Accordingly, the facility 
selects the following terms having inverse document frequencies greater than .0010000 as 
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key words, as indicated in Table 1 by the underlining of these terms: Archie, Jughead, 
ceramic, salt, pepper, shakers. 

After identifying the terms listed above as key words, the facility searches 
the purchasing opportunity table for purchasing opportunities whose items descriptions 
5 contain one or more of the key words. In this regard, the facility determines that the item 
descriptions for the purchasing opportunities having the following purchasing opportunity 
identifiers contain one or more of the key words: 219765112, 342197558, and 
919822507. 

The facility next generates a score for each of these three purchasing 
10 opportunities by summing the inverse document frequencies of any key words occurring 
in the item description for each purchasing opportunity. The results are shown below in 
Table 2. 



Found 


Term From Item 


Inverse Document 


Purchasing 


Description for Found 


Frequency in 


Opportunity 


Purchasing 


Distinguished Item 


Identifier 


Opportunity 


Description 


219765112 


ceramic 


.0012735 






.0012735 


342197558 


salt 


.00915263 




pepper 


.00897250 




shakers 


.00904123 




Archie 


.04123692 




Jughead 


.06523911 






.13364239 


91982507 


Archie 


.04123692 




Jughead 


.06523911 






.10647603 



TABLE 2 



15 The calculated purchasing opportunity scores may be used in a variety of 

ways. For example, the purchasing opportunity scores may be used to order a list of the 
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found purchasing opportunities from most similar to least similar. In such a list, 
purchasing opportunity 342197558 would be first, purchasing opportunity 919822507 
would second, and purchasing opportunity 219765112 would be third. 



purchasing opportunities in a variety of ways. For example, the purchasing opportunity 
scores may be used to subset the list of found purchasing opportunities to include only 
(a) purchasing opportunities whose scores exceed an absolute threshold; (b) a fixed 
number of purchasing opportunities having the highest scores; or (c) a fixed percentage of 
the found purchasing opportunities having the highest scores. In terms of the example, to 
identify a group of similar purchasing opportunities, the facility preferably employs an 
absolute threshold of .05000000, thereby including purchasing opportunities 342197558 
and 919822507 and excluding purchasing opportunity 219765112. In order to identify 
purchasing opportunities having a high likelihood of offering the same item as the 
distinguished purchasing opportunity, the facility preferably applies an absolute threshold 
of .11000000, thereby including purchasing opportunity 919822507 and excluding 
purchasing opportunities 219765112 and 342197558. 



facility may be straightforwardly adapted or extended in various ways. For example, the 
facility may be applied to purchasing opportunities of every conceivable sort. Further, 
the facility may employ a variety of other approaches to assessing term and purchasing 
opportunity scores. While the foregoing description makes reference to preferred 
embodiments, the scope of the invention is defined solely by the claims that follow and 
the elements recited therein. 
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The purchasing opportunity scores can further be used to subset the found 



It will be appreciated by those skilled in the art that the above-described 
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